DC Proposal: Model for News Filtering with Named Entities
نویسنده
چکیده
In this paper we introduce the project of our PhD thesis. The subject is a model for news articles filtering. We propose a framework combining information about named entities extracted from news articles with article texts. Named entities are enriched with additional attributes crawled from semantic web resources. These properties are then used to enhance the filtering results. We described various ways of a user profile creation, using our model. This should enable news filtering covering any specific user needs. We report on some preliminary experiments and propose a complex experimental environment and different measures.
منابع مشابه
PAYMA: A Tagged Corpus of Persian Named Entities
The goal in the named entity recognition task is to classify proper nouns of a piece of text into classes such as person, location, and organization. Named entity recognition is an important preprocessing step in many natural language processing tasks such as question-answering and summarization. Although many research studies have been conducted in this area in English and the state-of-the-art...
متن کاملGraphical Viewing of Relationships Extracted from Online Articles
This paper discusses an approach to extracting and viewing relationships between named entities from news articles. It covers the preprocessing, parsing, extraction, filtering, and visualization of this information, starting from online news articles and ending with a visual graph of concepts they contain.
متن کاملNamed Entity Trends Originating from Social Media
There have been many studies on finding what people are interested in at any time through analysing trends in language use in documents as they are published on the web. Few, however have sought to consider material containing subject matter that originates in social media. The work reported here attempts to distinguish such material by filtering out features that trend primarily in news media....
متن کاملNamed Entity Recognition in Chinese News Comments on the Web
News comment is a new text genre in the Web 2.0 era. Many people often write comments to express their opinions about recent news events or topics after they read news articles. Because news comments are freely written without checking, they are very different from formal news texts. In particular, named entities in news comments are usually composed of some wrongly written words, informal abbr...
متن کاملNamed Entity Discovery Using Comparable News Articles
In this paper we describe a way to discover Named Entities by using the distribution of words in news articles. Named Entity recognition is an important task for today’s natural language applications, but it still suffers for its data sparseness. We used an observation that a Named Entity often appears synchronously in several news articles, whereas a common noun doesn’t. Exploiting this charac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011